Noise-Thresholding with Empirical Mode Decomposition for Low Distortion Speech Enhancement

نویسنده

  • Erhan Deger
چکیده

Degrading the quality and intelligibility of the speech signals, background noise is a severe problem in communication and other speech related systems. In order to get rid of this problem, it is important to enhance the original speech signal mainly through noise reduction. Speech enhancement is the term used to describe such algorithms and devices whose purpose is to improve some perceptual aspects of the speech for the human listener or to improve the speech signal so that it may be better exploited by other speech processing algorithms. Development and widespread deployment of digital communication systems during the last twenty years have brought increased attention to the role of speech enhancement in speech processing problems. For this purpose, this thesis presents novel speech enhancement methods based on applying some thresholding strategies in Empirical Mode Decomposition (EMD) domain. Since speech signals are nonlinear and non-stationary in nature, the performance of related studies is significantly dependent on the analysis method. Although Fourier transform and wavelet analysis made great contributions, they suffer from many shortcomings in the case of nonlinear and non-stationary signals. The EMD, recently been pioneered by Huang et. al. as a new and powerful data analysis method for nonlinear and non-stationary signals has made a novel and effective path for speech enhancement studies. Basically, EMD is a data-adaptive decomposition method with which any complicated data set can be decomposed into zero mean oscillating components, named intrinsic mode functions (IMFs). Such functions give sharp and meaningful identifications of instantaneous frequencies. Recent studies have shown that with EMD, it is possible to successfully identify the noise components from the IMFs of the noisy speech. For instance, in case of white noise, most of the noise components of a noisy speech signal are centered on the first three IMFs due to their frequency characteristics. Thresholding is a widely used process in noise reduction algorithms. The idea is to determine a threshold value and to apply different subtraction algorithms for the segmented regions. However, it is never easy to identify and remove the noise components while keeping the original speech components non-degraded. That is why; one of the major drawbacks of these kinds of processes is the degradation of the speech signal, especially in the process of noisy signals with high signal-to-noise ratios (SNR). In order to minimize the degradation of the original speech components, a modified soft-thresholding strategy that works on a frame basis is adapted in this study. The IMFs of the noisy speech signal are denoised by applying the modified soft-thresholding strategy on the coefficients of each IMF. With the proposed strategy, most of the noise components are successfully removed while the speech components are mainly kept. This strategy enables even signals with high SNRs to be processed effectively. It is never possible to remove all the noise components in a noise reduction method. The remaining noise parts may result in an irritating sound which is referred as the musical noise. That is why; most speech enhancement algorithms not only introduce speech distortion but also suffer from the musical noise artifact. The proposed EMD based algorithm is highly effective in noise removal and introduces a rather discrete noise than a continuous musical sound. In order to obtain even better results for speech quality and quantity, the method was further improved by introducing a Discrete Cosine Transform (DCT) based thresholding as a first stage. The twostage algorithm gives efficient results, successfully improving the SNR of the noisy speech and removing most of the noise components. The thesis work mainly concentrates on white noise; however the method has been further improved with a sub-band approach so that it may be applied to different noise types.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement using empirical mode decomposition and the Teager-Kaiser energy operator.

In this paper a speech denoising strategy based on time adaptive thresholding of intrinsic modes functions (IMFs) of the signal, extracted by empirical mode decomposition (EMD), is introduced. The denoised signal is reconstructed by the superposition of its adaptive thresholded IMFs. Adaptive thresholds are estimated using the Teager-Kaiser energy operator (TKEO) of signal IMFs. More precisely,...

متن کامل

Speech E Ha Ceme T Usi G Soft Thresholdi G with Dct-emd Based Hybrid Algorithm

This paper introduces a new speech enhancement method using soft thresholding with a Discrete Cosine Transform (DCT) and Empirical Mode Decomposition (EMD) based hybrid algorithm. Soft thresholding for DCT-enhancement is a powerful method for enhancing the noisy speech signal in a wide range of signal-to-noise ratios (S Rs). However, due to the thresholding criteria a significant amount of nois...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Speech Enhancement Using EMD Based Adaptive Soft-Thresholding (EMD-ADT)

This paper presents a novel algorithm of speech enhancement using data adaptive softthresolding technique. The noisy speech signal is decomposed into a finite set of band limited signals called intrinsic mode functions (IMFs) using empirical mode decomposition (EMD). Each IMF is divided into fixed length subframes. On the basis of noise contamination, the subframes are classified into two group...

متن کامل

EMD based soft-thresholding for speech enhancement

This paper introduces a novel speech enhancement method based on Empirical Mode Decomposition (EMD) and softthresholding algorithms. A modified soft thresholding strategy is adapted to the intrinsic mode functions (IMF) of the noisy speech. Due to the characteristics of EMD, each obtained IMF of the noisy signal will have different noise and speech energy distribution, thus will have a differen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008